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Ab018 315.Emhuml ' 

! !NA_SEQUENCE 1.0 

ID AB018315 standard; RNA; HUM; 3879 BP. 

AC AB018315; 
XX 

SV AB018315.1 
XX 

DT 17-NOV-1998 (Rel. 57, Created) 

DT 02-SEP-1999 (Rel. 60, Last updated, Version 2) 
XX 

DE Homo sapiens mRNA for KIAA0772 protein, complete cds. 



OS Homo sapiens (human) 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; Mammalia; 

OC Eutheria; Primates; Catarrhini; Hominidae; Homo. 
XX 

RN [1] 

RP 1-3879 

RA Ohara 0., Suyama M. , Nagase T. , Ishikawa K. , Kikuno R.; 



kt ; 

RL Submitted ( 08-OCT-1998 ) to the EMBL/GenBank/DDBJ databases. 

RL Osamu Ohara, Kazusa DNA Research Institute, Laboratory of DNA Technology; 

RL Yana 1532-3, Kisarazu, Chiba 292-0812, Japan (E-mail :cdnainfo§kazusa. or . jp, 

RL Tel:+81-438-52-3913, Fax:+81-438-52-3914) 
XX 

RN [2] 

RA Nagase T. , Ishikawa K. , Suyama M. , Kikuno R. , Miyajima N. , Tanaka A., 

RA Kotani H. , Nomura N. , Ohara 0.; 

RT "Prediction of the coding sequences of unidentified human genes. XI. The 

RT compl te sequences of 100 new cDNA clones from brain which code for large 

RT proteins in vitro"; 

RL DNA Res. 5:277-286(1998). 

XX 

DR SPTREMBL; Q9Y4B8; Q9Y4B8. 



XX 
KW 
XX 



XX 
FH 
FH 
FT 
FT 
FT 
FT 
FT 
TT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
-T 



Key 



source 



CDS 



Location/Qualifiers 



1. .3879 

/db_xref = n taxon : 9606" 
/sequenced_mol="cDNA to mRNA" 
/sex="male" 

/organism="Homo sapiens" 

/clone_lib="pBluescriptII SK plus" 

/dev_stage="adult" 

/tissue_type="brain" 

/clone^hkOSlig" 

148. .1554 

/db_xref ="SPTREMBL: Q9Y4B8" 
/gene="KIAA0772" 
/product="KIAA0772 protein" 
/protein_id="BAA34492 .1" 



*T /trans lation= n MNGEEEFFDAVTEANQKVTGHIDLDTSKNNRIGKTGERPSQENGI 

FT QKHRTSLPAPMFSRSDFSVWTILKKCVGLELSKITMPIAFNEPLSFLQRITEYMEHVYL 

FT I HRASCQPQPLERHQSVAAFAVSAVASQWERTGKPFNPLLGET Y ELI REDLGFRF I SEQ 

FT . VSHHPPISAFHSEGLNHDFLFHGSIYPKLKFWGKSVEAEPRGTITLELLKHNEAYTWTN 

FT PTCCVHNVI IGKLWIEQYGTVEILNHRTGHKCVLHFKPCGLFGKELHKVEGHIQDKNKK 

FT KLFMIYGKWTECLWGIDPVSYESFKKQERRGDHLRKAKLDEDSGKADSDVADDVPVAQE 

FT TVQVIPGSKLLWRINTRPPNSAQMYNFTSFTVSLNELETGMEKTLPPTDCRLRPDIRGM 

FT ENGNMDLASQEKERLEEKQREARRERAKEEAEWQTRWFYPGNNPYTGTPDWLYAGDYFE 

FT RNFSDCPDIY" 
XX 



SQ Sequence 3879 BP; 978 A; 913 C; 1015 G; 973 T; 0 other; 



3315 


Length: 3879 August : 


11, 19100 11:15 Type: 


N Check: 1 


1 


CCGAGGGACC 


CGCGTCCAGA 


TCTTCAGTGT 


CTATTGGATT 


TTTCCAAGAG 


51 


AAAGTTTGTA 


AAATTCCTTA 


CACTGTAGAT 


GTGGATCAGA 


TACGATGATT 


101 


CAGTAGAAGA 


GCACATGTCA 


GGGGCAGTGG 


AGGCTGGCTG 


CTGAAGGATG 


151 


AACGGAGAGG 


AAGAATTCTT 


TGATGCCGTC 


ACAGAGGCAA 




201 


CACGGGAATG 


ATTGACTTAG 


ACACCAGCAA 


AAATAATAGG 


ATTGGGAAAA 


251 


CTGGGGAGAG 


GCCCTCTCAA 


GAGAACGGAA 


TTCAGAAACA 


CAGGACATCG 


301 


CTGCCGGCTC 


CCATGTTCAG 


CAGAAGCGAC 


TTCAGCGTGT 


GGACCATCCT 


351 


GAAGAAGTGT 


GTTGGCCTGG 


AGCTGTCCAA 


GATCACGATG 


CCAATCGCCT 


401 


TCAACGAGCC 


TCTGAGCTTC 


TTGCAGCGGA 


TCACGGAGTA 


CATGGAGCAC 


451 


GTGTACCTCA 


TCCACAGGGC 


CTCCTGCCAG 


CCCCAGCCCC 


TGGAGAGGAT 


501 


GCAGTCTGTG 


GCTGCTTTTG 


CTGTTTCGGC 


TGTGGCTTCC 


CAGTGGGAGA 


551 


GGACCGGCAA 


ACCATTTAAT 


CCACTCTTGG 


GAGAAACGTA 


TGAATTAATC 


601 


AGGGAAGATT 


TAGGATTCAG 


ATTTATATCG 


GAACAGGTCA 


GTCACCACCC 


651 


CCCCATCAGT 


GCGTTCCACT 


CGGAAGGTCT 


CAACCATGAC 


TTCCTGTTCC 


701 


ATGGCTCCAT 


CTACCCCAAG 


CTCAAGTTCT 


GGGGCAAAAG 


CGTGGAGGCG 


751 


GAGCCCCGAG 


GCACCATCAC 


CCTGGAGCTG 


CTCAAACATA 


ATGAAGCCTA 


801 


CACCTGGACC 


AACCCCACCT 


GCTGCGTCCA 


CAACGTCATC 


ATCGGGAAGC 



851 



TGTGGATAGA GCAGTATGGG ACAGTGGAGA TTTTAAACCA CAGAACTGGA 



901 


CATAAGTGTG 


TGCTTCACTT 


TAAACCGTGT 


GGATTATTTG 


GAAAAGAACT 


951 


TCACAAGGTG 


GAAGGACACA 


TTCAAGACAA 


AAACAAAAAG 


AAGCTCTTTA 


1001 


TGATCTATGG 


CAAATGGACG 


GAATGTTTGT 


GGGGCATAGA 


TCCTGTTTCG 


1051 


TATGAATCCT 


TCAAGAAGCA 


GGAGAGGAGA 


GGTGACCACC 


TGAGAAAGGC 


1101 


CAAGCTGGAT 


GAAGACTCCG 


GGAAGGCTGA 


CAGCGACGTG 


GCTGACGACG 


1151 


TGCCTGTGGC 


CCAGGAGACC 


GTGCAGGTCA 


TTCCTGGCAG 


CAAGCTGCTC 


1201 


TGGAGGATCA 


ACACCCGGCC 


CCCCAACTCT 


GCCCAGATGT 


ATAATTTCAC 


1251 


CAGTTTCACT 


GTGAGCCTCA 


ACGAGCTGGA 


GACAGGCATG 


GAGAAGACCC 


1301 


TGCCACCCAC 


GGACTGCCGC 


CTGCGCCCTG 


ACATCCGCGG 


CATGGAGAAT 


1351 


GGCAACATGG 


ATCTGGCCAG 


CCAGGAGAAG 


GAGCGGCTGG 


AGGAGAAGCA 


1401 


GAGAGAAGCA 


CGGAGGGAGC 


GGGCCAAGGA 


GGAGGCAGAG 


TGGCAGACGA 


1451 


GGTGGTTCTA 


CCCAGGCAAT 


AACCCCTACA 


CTGGGACCCC 


CGACTGGTTG 


1501 


TATGCAGGGG 


ATTACTTTGA 


GCGGAATTTC 


TCCGACTGCC 


CAGATATCTA 


1551 


CTGAGGGCCT 


GGAGGGGCCT 


GGGGCCCGGG 


ACCGGAGGCT 


GACGAGGCTG 


1601 


GACTTCCTCG 


AGTGGCCATT 


GTGAGCCTCG 


TCACAGCAGA 


AACCAACTTT 


1651 


TCTAACGACT 


GAGTTCGCGG 


AGATAGCATC 


ATCCCTGATC 


AAGGATGTAA 


1701 


TTCTAATTAA 


CTGTTGATTG 


CCAAACATTT 


CACTCTGCTG 


TGCCGTCTCT 


1751 


TCATAAAGCT 


TCACTTGGGA 


TCATCGTCTT 


CATTAAGGTT 


TCAACAGGGA 


1801 


AATTCTTCAC 


GGCGCCCTTT 


TATGTGGCAG 


AAATCAGCTG 


GGGCTTGTTT 


1851 


AGCTTCCAGC 


ACACTCTCAG 


TCATAGCATG 


TGTAGCTAAA 


GGAAGTAATG 


1901 


GGAAGGGGTT 


CATGTTCTCT 


TTATAATGCA 


GTGGCAAAAG 


GTTCTGAAAG 


1951 


CCTTTTAAAC 


TCGAACCAGT 


GGGGGAAAGA 


TGGATCTTGA 


AGCTAATCCT 


2001 


GCAGAGAGTT 


TTATAGAGGC 


CAGGGATTGC 


CTTCTAAATT 


ATGATAAAAC 


2051 


AGAAGTGAAG 


AGTTTCAGAG 


CATCAGATTG 


AGTGAAAAGT 


TGTCAGATTC 


2101 


TGTATTTTTT 


AACAATCTTC 


AATAATGTAA 


AGATTACTTT 


TAAAATATTT 


2151 


AAGTTAAAAC 


TACTTGAATA 


GTATTTTGCT 


GAAGAGCAAG 


ATATGCATTA 


2201 


ATCACCGGTT 


TTATACTGTC 


CAAAATGAAG 


CATCCCCGTG 


ACAAACCAGA 


2251 


GTGGGCAGAA 


GCATCGAGAG 


CGTGACAGGA 


AATCCCAAGA 


CTGCTTCCGC 


2301 


CTCAGAGGCG 


TCCCGGCTGC 


GATTCGCTGC 


CCTGTTGTCA 


GTGAGGCCTG 



2351 


GCTGTCACCG 


CACACCGCGT 


CCGTGTCTCC 


AGGGGGTTCC 


TTTCTTCTCA 


2401 


CACGTCGCGT 


GTACCCATAG 


CACTCTTGTG 


TTTCTGTTTT 


I CCCAG 1 A J. G 


2451 


CATGTTTAAA 


ATAGAAGTGA 


CAAGAATCAC 


ATCCGGTTGT 


G I CC 1 G 1 GGG 


2501 


AGGGTCAGAG 


GCAGAATCTA 


CTTACAGTGG 


1G1AA1 1 AAA 


/•-mm x mmm i»p 
Gi IA1 1 1AAU 


2551 


CAAAAATAGG 


TATGTGTCCA 


TCTCAGCATT 


CACCTTTATC 


AAGTGACTGA 


2601 


TTTTTTTTTC 


TTTTCTTTCC 


TTTTTTTTTT 


TTTTTTGAGA 


CGGAGTTTCA 


2651 


CTCTTGTTGC 


CCAGGCTGGA 


GTGCAATGGC 


ATGATCTCGG 


CTCACCGCAA 


2701 


CCTCCGCCTC 


CCGGGTTCAA 


GCGATTCTCC 


TGCCTCAGCC 


TCCCAAGTAG 


2751 


CTGGGATTAC 


AGGCACGCGC 


CACCACGCCT 


GGCTGATTTT 


GTATTTTTAG 


2801 


TAGACACGGG 


TTTTCACCAT 


GTTGGTCAGG 


CTGGTCTCAA 


ACTCCCGACC 


2851 


TCAAGTAGTC 


TGCCTGCCTC 


AACCTCCCAA 


AGTGCTGGGA 


TTACAGGCGT 


2901 


GAGCCACTGC 


GCCTGGCCGT 


GACTGATTTT 


TTTTCATGTA 


GAATTGTCAA 


2951 


CACGAGAGAT 


CACAGTGGAG 


CACTTTGAAA 


GACCGTCGGT 


TGTGTGCACG 


3001 


CACGCACACA 


CTCATGCACA 


CGCTGACACG 


CGGTTGCATG 


GAGTCCAGGT 


3051 


TACTCAGGCC 


GGCACTTCTG 


AGTGACAGGT 


GCCACCTGCG 


TGTGTCTTGG 


3101 


CGTCCACATC 


ACACCTGTGA 


CGGAAGCACT 


TCTGGAAGTG 


AACACTCGTT 


3151 


TTGAAAGCTT 


GATTTTGTAG 


CTTTGGAAGC 


TGGAAGCGAT 


GGTGTTTGGT 


3201 


GCCGAGTCCT 


GTGTCATCCT 


CGGGGCCTAT 


GAGCTCCGTA 


CCAGCCACTC 


3251 


AAAAGTGTCT 


GAACAGAACC 


GCTCCGTGAC 


TGGTAGCTGG 


GTCTGAGGAT 


3301 


TCAGGATTGT 


GGCGTTATTC 


AAAGAGGAGA 


CTTTGAAATT 


CCCCGATGGC 


3351 


TGGAATGTGG 


AGCCCAGGTG 


CCTCTGGTGG 


AGGGTCATCT 


GCTTTTCCAG 


3401 


ACTGTGGTTG 


TGAACCGGCT 


CCTTCTCCAA 


GAAAGGTTGC 


AAGCTAGAAC 


3451 


ATCCAGAGGT 


GAGACTCAGA 


CACATTGAAA 


GTGACTGCAT 


TTAGGGAGGT 


3501 


TTAACGAGTT 


CTTACTGATC 


ATTCCACTTG 


TTACTGGTTA 


AGATAATTTG 


3551 


CCCACGGGTT 


TGTTTCCAAG 


TCCTCTTCTA 


GGACCAGGCT 


CCTGGTATTT 


3601 


CAGGGGCTGG 


TTGGCTGCAC 


AGACAGCCCC 


TCTTCTGCTG 


TCCTTGAGGA 


3651 


CAGACACCAA 


ACCAGAGGTG 


GAGGAAGAAC 


GGTAGGAAGG 


CTGATGGCAA 


3701 


AAGCGGCTGT 


GTGTCGAGGT 


TATTTTAACT 


TTTTACTACT 


TTTTGTTACT 



3751 GTTTCTGCAA ATgI^ACAC ATAAACCATG ACCTAACT^ TGTCACCTTG 
3801 GATATCTATT GAATGTTAAA CATCTCTAAT AAAGATGGCC ACCACTTAAT 

3851 GTGTGGAAAG TGATGGCCTT CTCGTGGGC 



